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Abstract 

The decay — > ip(2S)ir + with tp{2S)—> /U 4 is observed with a significance of 
5.2cr using pp collision data corresponding to an integrated luminosity of 1.0 fb _1 
collected by the LHCb experiment. The branching fraction of —> ip(2S)*K + decays 
relative to that of the B^ — > J/ip ir + mode is measured to be 

B(B + —> ib(2S)7r + ) 
\ c , , ' ' = 0.250 ± 0.068 (stat) ± 0.014 (syst) ± 0.006 (B). 

B(BJ^ J/lpTT+) 

The last term is the uncertainty on the ratio B(rf(2S)—> fi + fi~)/B(J/tfj — > fi + fi~). 
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The B£ meson, discovered by CDF at the Tevatron [1], is the only known meson composed 
of two flavours of heavy quarks, charm and beauty. Both quarks can decay via the 
weak interaction with the other quark being considered as a spectator, therefore a wide 
range of decay channels are possible. However, only a few of these channels have been 
experimentally observed [l[j4]- The LHC opens a new era for physics, with an expected 
production cross-section of ~ 0.4 ub at centre-of-mass energy y/s = 7 TeV for the 
meson [5j[6]. The LHCb experiment has observed the decay B+ — )■ J/ipn + |7], and new 
channels such as B~£ — > J/tpTT + TT + ir~ M have started to emerge. 

We report here the first observation of the decay B^ — > tp(2S)ir + with ip(2S) — > yU + yU~ 
and the measurement of the ratio of branching fractions B(B£ — > ip(2S)7r + )/B(B+ — > 
J/i/j7r + ). The inclusion of charge conjugate modes is implied throughout the paper. The 
relativistic quark model [9] and several other models 10 13 make various theoretical 
predictions for this ratio of branching fractions. As a two-body decay, B^ — >• ip(2S)7i + is 
under better control theoretically than B+ — > J/ip7i + 7i^7i + , and therefore this measurement 
is particularly useful to test the models of B+ decays. The B+ — > J/ip ti + decay mode is 
chosen as the normalisation channel because of its identical final state and similar event 
topology. Both channels take advantage of the large trigger efficiency due to the two 
muons in the final state. 

The analysis is based on pp collision data corresponding to an integrated luminosity of 
1.0 fb _1 at y/s = 7 TeV collected with the LHCb detector in 2011. The detector 
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is a 

single- arm forward spectrometer covering the pseudorapidity range 2 < r] < 5, designed 
for the study of particles containing b or c quarks. The detector includes a high precision 
tracking system consisting of a silicon-strip vertex detector surrounding the pp interaction 
region, a large-area silicon-strip detector located upstream of a dipole magnet with a 
bending power of about 4Tm, and three stations of silicon-strip detectors and straw 
drift tubes placed downstream. The combined tracking system has momentum resolution 
Ap/p that varies from 0.4% at 5 GeV/c to 0.6% at 100 GeV/c, and impact parameter (IP) 
resolution of 20 [xm for tracks with high transverse momentum (pr)- Charged hadrons are 
identified using two ring-imaging Cherenkov detectors and good kaon-pion separation is 
achieved for tracks with momentum between 5 GeV/ c and 100 GeV/ c. Photon, electron and 
hadron candidates are identified by a calorimeter system consisting of scintillating-pad and 
preshower detectors, an electromagnetic calorimeter and a hadronic calorimeter. Muons 
are identified by a system composed of alternating layers of iron and multiwire proportional 
chambers. The trigger system 15 consists of a hardware stage, based on information from 
the calorimeter and muon systems, followed by a software trigger that applies a full event 
reconstruction and reduces the event rate from 1 MHz to around 3 kHz. 

Candidate B+ — > ipn + decays with ip — > fi + fi~ , where ip denotes J/ip or ip{2S), are 
selected by requiring a single muon or dimuon with high p^ in the hardware trigger. In the 
software trigger, a charged particle is required to have pt > 1-7 GeV/c, or p T > 1 GeV/c 
if identified as a muon; alternatively a dimuon trigger requires two oppositely charged 
muons with pi > 500 MeV/c, the invariant mass of the muon pair M^^- > 2.95 GeV/c 2 , 
and that the muon track pair has a decay length significance with respect to the primary 
vertex greater than 5. 



1 



Further offline selections require both muons to have pr > 550 MeV/ c, and a track fit 
Xt r per degree of freedom (x 2 r /ndf) of less than 5. The mass of the ip candidate is required 
to be within a window of 100 MeV/c 2 centred around the known ip mass (3686 MeV/c 2 
for i/j(2S) and 3097 MeV/c 2 for J/0) [l6]. The ifj vertex fit Xvtx/ n df * s required to be less 
than 20, and the ip decay length significance larger than 5. 

The B£ candidate is reconstructed from the ip and a bachelor pion. The pion is 
required to have p^ > 500 MeV/c, a track fit x 2 r / n df < 10 and IP Xjp whh respect to the 
primary interaction great than 4. The IP Xip is defined as the difference between the 
X 2 of the primary vertex reconstructed with and without the considered track. The 
candidate is required to have mass within 0.5 GeV/c 2 around the world average value fl6] 
and a vertex fit Xvtx/ ncuC < 16. 

A boosted decision tree (BDT) [17] , trained on data and simulation, is used to perform 
further background suppression. The pp collisions are simulated using Pythia 6.4 |18| 



with a specific LHCb configuration 19 . The B+ mesons are generated through the 



dominant hard subprocess gg — y B+ + b + c with the dedicated generator Bcvegpy 20 ,21 



Decays of hadronic particles are described by EvtGen [22] in which final state radiation is 



generated using Photos 23 . The interaction of the generated particles with the detector 



and its response are implemented using Geant4 24 as described in Ref. 25 . 

The choice of the variables used to train the BDT is based on two considerations: their 
power to separate signal and background, and the similarity of the distributions for the 
B+ — y J/ipn + and B+ — > ip{2S)ix + candidates that causes the systematic uncertainties in 
the selections to cancel when the ratio of branching fractions is determined. The BDT 
input variables are: the n + IP xjp', the B+ vertex fit x 2 tx /ndf; the B+ IP Xip; the x 2 °f 
the distance between the B+ vertex and the associated primary vertex; the pt of the B+ 



candidate; and the y 2 from a refit of the B+ decay vertex 26 using a J/ip or ip{2S) mass 
constraint and a constraint that the B+ candidate points to the primary vertex. 

The BDT is trained using a B£ — y J/ipTT + simulation sample for the signal and 
sidebands from the B+ — > J/ipir + mass spectrum (6164 < Mju^ < 6206 MeV/c 2 or 
6346 < Mj/^ n < 6388 MeV/c 2 ) for the background. The trained BDT is then applied to the 
data, and a signal estimator is calculated for each candidate; a large value indicates a signal- 
like candidate. The cut on the estimator is optimised to maximise the B£ — y %l){2S)rt + signal 
significance. The BDT selection efficiencies, estimated from simulation, for B+ — y if)(2S)n + 
and B+ — y J/tpn + candidates are 35.8% and 37.2% respectively, and the fraction of accepted 
background is 4.8 x 10~ 4 as estimated from the sideband data. 

After the BDT selection, it is further required that the unconstrained dimuon invariant 
mass is in the range 3030 < M w < 3170 MeV/c 2 for J/i/j and 3620 < < 3760 MeV/c 2 
for ip(2S). Information on particle identification for pions and kaons is also used to suppress 
the reflection background due to B+ — >■ J/ipK + decays. Figure [l] shows the invariant mass 
distributions of the B+ — y J/ipn + and B+—> ip{2S)n + candidates. 

The relative branching fraction is calculated using 



B{B+^ 0(2S)7r + ,0(2S)-+ fi+fi-) _ iV(£+->0(2ff)7r + ) e(B+ J/ipir' 
B(B+^ J/07T+, J/lp ->■ fJL+fl-) ~ N(B+^ J/^7T+) ' e(B+ ^ i/j(2S)t, 



(1) 



2 




Figure 1: Invariant mass distributions of candidates reconstructed as (a) — > J/ip7r + 
and (b) B+ — > ip{2S)ir + . Points with error bars (black) show the data, the thick solid 
line (blue) represents the fit of the data, the dashed line (red) the signal distribution, 
the dotted line (green) the combinatorial background, the dot-dashed line (purple) the 
partially reconstructed background, and the thin solid line (light blue) the background 
from the — > J/ip K + channel. 



where N is the number of selected signal events and e is the total efficiency. 

The signal yields are obtained by performing an extended maximum likehood fit to 
the .B+ mass spectra in Fig. [I] The signal is modelled with a double-sided Crystal Ball 



function 27 with the tail parameters on both sides determined from simulation. The 
main background component for both channels is combinatorial and is modelled using an 
exponential function. At the lower end of the mass spectrum, the contribution from the 
partially reconstructed background is modelled by an ARGUS function (28] convolved with 
a Gaussian distribution. For the B+ — > J/ip 7r + decay, the Cabibbo suppressed channel 
B+ — > J/ip K + also contributes, and is fitted with a double-sided Crystal Ball function with 
all parameters fixed to values obtained from simulation. The observed signal yields are 
595 ± 29 for B+ ->• J/ip7r+ and 20 ± 5 for B+ ->■ ^(2S)n + . Therefore the ratio of yields is 

N(B+^ <K2S)vr H 



0.034 ±0.009 (stat). 



The total efficiency is the product of the detector acceptance, and the trigger, recon- 
struction and selection efficiencies. Each contribution has been determined using simulated 
events for the two channels, and the ratio of the total efficiencies has been evaluated to be 

e(B+^ V(25)tt + ) 
e(B+-> J/i)TT+) 

where the uncertainty is due to the limited size of the simulated sample. 

Several sources of systematic uncertainty have been considered. The measured ratio 
of signal yields is expected to be independent of the BDT selection, given that the 
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Table 1: Relative systematic uncertainties. 



Component Value (%) 

BDT selection 4.5 

Signal shape 1.7 

Background shape 2.9 

Simulation sample size 0.9 

Total Kf 



distributions of training variables are very similar for the two channels. The ratio of signal 
yields is measured for different cuts on the BDT response, and is constant within the 
statistical uncertainties. The average of these ratios differs from the nominal value by 
4.5%, which is taken as the systematic uncertainty due to the BDT selection. 

The — > ifj(2S)n + signal is fitted with a double-sided Crystal Ball function. Al- 
ternatively we determine the signal shape directly from the simulation using kernel 
estimation 29 , and convolve it with a Gaussian function to take into account the detector 
resolution while allowing the mean of the mass to vary. This results in a 1.7% difference 
with respect to the nominal ratio, which is taken as the uncertainty due to the signal 
shape. 

To consider the contribution from partially reconstructed background, the background is 
fitted with an exponential function within a narrower range (6164 < < 6500 MeV/c 2 ). 
This results in a 2.9% change with respect to the nominal fit, and is assigned as a systematic 
uncertainty. 

The statistical uncertainty on the simulation when estimating the ratio of efficiencies 
leads to an uncertainty of 0.9% on the ratio of branching fractions. The difference between 
data and simulation introduces a systematic uncertainty, especially from variables used 
as input for the BDT. The distributions of these variables in simulation and data are 



compared, after the background is subtracted from the data using the sPlot technique 30 
The difference is found to be negligible compared to the statistical fluctuation. 

A summary of systematic uncertainties is given in Table [TJ The total systematic 
uncertainty is 5.7%, with the most significant contribution coming from the BDT selec- 
tion. Taking the systematic uncertainty into account and using the likelihood ratio test 
a/— 2 log(£B/£s+B) |31] > the significance of the — > iI){2S)-k + decay is estimated to be a 
5.2 cr, where £b an d £s+b represent the likelihood of the background-only hypothesis and 
the signal-plus-background hypothesis respectively. 

In summary, a search for the decay — > ip(2S)n + has been performed using a data 
sample corresponding to an integrated luminosity of 1.0 fb _1 at ^/s = 7 TeV collected 
by LHCb in 2011. The signal yield is 20 ± 5 candidates, making the first observation of 
this decay channel. Using B+ — > J/ipTT + as normalisation channel, the ratio of branching 
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fractions is measured to be 

B(B+ ->• V(25)tt+, if>(2S) -> (i+ff 

B(B+-> J/lp7T+, J/i/j //+//") 



0.033 ± 0.009 (stat) ± 0.002 (syst). 



Furthermore, taking B(J/ip -> e+e") = (5.94 ± 0.06)% and B{ij){2S) -> e+e - ) = 
(7.73 ± 0.17) x 10 -3 [l6] and assuming universality of the electroweak interaction, we 
obtain 

B(B+^ ^(2S)tx + ) , s , 

= 0.250 ± 0.068 (stat) ± 0.014 (syst) ± 0.006 (B), 

B{Bj —¥ J/ip'K + ) 

where the last term accounts for the uncertainty on B(tp(2S)—} fi + fi^)/B(J/ip —> /i + /i~). 
This result favours the prediction made by the relativistic quark model [9] in comparison 
with the other models. 
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